SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: TUCKER et al . 

(ii) TITLE OF INVENTION: MORAXELLA CATARRHAL I S OUTER MEMBRANE 
PROTEIN-106 POLYPEPTIDE, GENE SEQUENCE AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PENNIE & EDMONDS LLP 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: USA 

(F) ZIP: 10036-2711 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/968,685 

(B) FILING DATE: November 12, 1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Baldwin, Geraldine F. 

(B) REGISTRATION NUMBER: 31,232 

(C) REFERENCE/DOCKET NUMBER: 7969-060 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 869-8864 

(C) TELEX: 66141 PENNIE 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

lie Gly lie Ser Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg 
15 10 15 
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Gly Asp Lys Ser lie Ala lie Gly Asp lie Ala Gin Ala Leu Gly Ser 
20 25 30 



Gin Ser lie Ala lie Gly Asp Asn Lys lie Val 
35 40 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Gly Thr Val Leu Gly Gly Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "probe" 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 3 : 
GGNACNGTNC TNGGNGGNAA RAAR 24 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .72 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
GAA GCG GAC GGG GGG AAA GGC GGA GCC AAT GCG CGC GGT GAT AAA TCC 4 8 
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Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg Gly Asp Lys Ser 



ATT GCT ATT GGT GAC ATT GCG CAA 
lie Ala lie Gly Asp lie Ala Gin 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg Gly Asp Lys Ser 
15 10 15 

He Ala He Gly Asp He Ala Gin 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : other nucleic acid 
(A) DESCRIPTION: /desc = "probe" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
YTTYTTNCCN CCNAGNACNG TNCC 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = ""probe" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GGNACNGTNT TRGGNGGNAA RAAR 
(2) INFORMATION FOR SEQ ID NO: 8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = ""probe"" 

(v) FRAGMENT TYPE: N- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

YTTYTTNCCN CCYAANACNG TNCC 24 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

CTATTGACTT AAATC A C CAT AT GGTTATAA TTTAGCATAA TGGTAGGCTT TTTGTAAAAA 6 0 

TCACATCGCA ATATTGTTTT ACTGTTACTA CCATGCTTGA ATGACGATCC AAATCACCAG 12 0 

ATTCATTCAA GTGATGTGTT TGTATACGCA CCATTTACCC TAATTATTTC AATCAAATGC 18 0 

CTATGTCAGC ATGTATCATT TTTTAAGGTA AACCACCATG AATCACATCT AT AAAGT CAT 24 0 

CTTTAACAAA GCCACAGGCA CATTTATGGC CGTGGCGGAA TATGCCAAAT CCCACAGCAC 3 00 

GGGGGGGGGT AGCTGTGCTA CAGGGCAAGT TGGCAGTGTA CGCACTCTGA GCTTTGCCCG 3 60 

TATTGCCGCG CTCGCTGTCC TCGTGATCGG TGCGACGCTC AATGGCAGTG CTTATGCAGG 420 

TATCGGAATT AGTGAAGCAG ACGGGGGAAA AGGCGGAGCC AATGCGCGCG GTGATAAATC 480 

CATTGCTATT GGTGATATTG CTCAGGCACT TGGCTCTCAA TCTATTGCTA TCGGTGACAA 54 0 

CAAAATAGTT CATAATTCAA ATAATAATGC TAATATAGGT GCCAAAGCCT CAGGTAATGA 600 

GTCCATCGCC ATCGGTGGTG ATGTATTGGC TTCTGGTCAT GCCTCGATTG CCATCGGTAG 660 

TGATGACTTA TATTTGAAAA AGGAAACGGT ACAGCAAATC TCAGAGCTTC TACCTATTAT 72 0 

TCGCGGACAG AAAGCATTAA ACGATATATA CCAACTAGCT GACACTAATC TTCAAAAATA 780 

TAGACGCACA CACGCACAGG GACACGCCAG TACTGCAGTG GGAGCCATGT CATATGCAAA 840 

GGGTCATTTT TCCAACGCCT TTGGTACACG GGCAACAGCT GAAGGTACCT ATTCCTTGGC 900 

AGTGGGTCTT ACCGCCACAG CCAAAGCAGC ATCTTCAATC GCTGTTGGTT CTAATGCACA 960 
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AGCTATCGGG TTTGCAGCGA CAGCCGTTGG TGGAAGTACT CAAGTTAATT TGAATCGAGG 1020 

TATTGCCCTA GGTTTTGGTT CTCAGGTCCT TCAGAAGGAT AATGATGTAA ATGCAGCAAA 10 80 

TGTACGGGCC TATGCACCAG ATGATAACCA GCCAATAGAC AACCGGTATA AAGCCACCTT 1140 

CAAGAATGGT GCTACGGATG TATTTTCCAT TGGTAATAGT AATGGGAATG ACAGTATCAG 1200 

GCGTAAAATC ATCAATGTCG GTGCAGGTTC TGCGGATACC GATGCGGTCA ATGTGGCACA 1260 

GCTTAAAGAG GCGGTGAGGC TGGCTAATCG TCAAATTACT TTTAAGGGTG ATGATAGCAA 1320 

TAATAGAGTA GAAAAAGGTT TGGGCAAGAC TTTAACTATC ACAGGTGGTG CACAGACCAG 13 80 

CGCATTAACC GATCATAACA TCGGTGTGGT ACAAAATGGC GATGGTCTGA AAGTTCAACT 1440 

TGCTGAAACT TTAACCAGCC TTAAAATGGT TACCACTGAA AACCTAACCG CCAACGAGAA 1500 

AGTTACCGTA GGCAAAACCC GCCTTACCAC AGATAAAATT GGTTTTACCA ATGATATGAA 1560 

TGGCATTGAT GAAAGCAAAC CTTATCTTGA TAAAGACACT GGCATTCATG CAGGTGGTCA 1620 

AAAGATTACC AAACTTACTG CTGGTGTAGT AGATGACGAT GCGGCAACTT ATGGACAGCT 16 8 0 

TAAAAAAGTT AACCAAACCG CTGAAAGTGC TCTACAAACC TTTACCGTTA AAAAGGTAGA 1740 

TAAAAATGGT AATGATGCTA ATGACAGCAA AATCATCACC GTGGGTAAAA ATAACAAACC 180 0 

AGACGGTACT CAAGTCAACA CCCTAAAACT CAAAGGTGAA AACGGTGTTG ATGTTACAAC 1860 

CGAAACAAAT GGTACAGTTA CCTTTGGGCT TAACCAAAAT AACGGTCTGA CCGTTGGCAA 192 0 

CAGCACCCTA AACAACGATG GCTTATCTGT TAAAAACACC AATAGTAACA AACAAATCCA 1980 

AGTCGGTGCT GATGGCATTA CATTTACTGA TATCAGCAAT AGTAAGCCAG GTGCTGGCAT 2040 

TGAAAATACC ACTCGCATTA CCAGAGACGG TATTGGTTTT GCTAATAATA CTGGTTCATT 2100 

GGATGCAAAC AAACCCCGCC TAACCCCAAC TGGCATTAAC GCAGGTGGTA AAGAGCTGAC 2160 

CAATGTCCAA TCTGC CATTA ACCCTGCTAC CAATGGTGGG CAGCTAGACT TTATGAACCG 2220 

CCTAAGCACT GCTAATACCG AAAAATCAGG CTCTGCCGCC ACCATTAAAG ACT TATAC AA 22 80 

CCTATCACAA GTACCGCTGA CCTTTGCAGG TGATACAGGT CCTAATGTCA CCAAAAAACT 2 34 0 

GGGCGAGATT TTAAAGGTTA AAGGTGGTAA AACCACAGCT GATGATTTAA CCAAAAATAA 2400 

CATCGGTGTG GTGGCTGATA GTACCGATAA TAGCTTAACC GTTAAACTTG CTAAAACTTT 2460 

AAGCGATCTT GATGCGGTTA ATACTAAAAC CCTAACTGCC AGCGATAAAG TT AC CGTAGA 252 0 

CAGTGGCAAC AACACCGCTA AGCTACAAAA TGGTGATTTA ACCTTTAGCA AACAAAATAC 2580 

AGGTGCTACC CCTGCCACCA ACAGCAAAAC CATCTATGGC GTTGATGGCT TGAAGTTTAC 2 64 0 

TGATAACAAT GGTATAGCAC TTGACGGCAC AACTTACATC ACCAAAGACA AAGTTGGCTT 2700 
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TGCTAAGCAA GATGGTTCAC TTGATAAAAG CAAACCTTAT CTTGATAAGG ACAAGCTAAA 2 7 60 

AGTGGGTGAA GTTGAGATTA CCACCAACGG CATTAATGCA GGTGGTAAAG CCATCACAGG 2820 

ACTAAGCAAT ACCCTAACCG ATGCCACCAA CGCAACAACA GGGCATGTAA CTCAATTGGG 28 80 

TATCGTTGAT AGTACTGACA AAACCCGTGC CGCCAGCATT GGTGATGTGC TAAACG C AGG 2940 

CTTTAACCTA AAAAATAATG GTGACGCCAA AGACTTTGTC TCCACTTATG ACACTGTTGA 3 00 0 

TTTTATCAAT GGCAATGCCA CCACCGCTAA AGTCACTTAT GATGGCAAAG CCAGTAAAGT 3 060 

GGCGTATGAT GTCAATGTGG ATGGTACAAC CATTCATCTA ACAGGCGCTG ATGGCAATAA 312 0 

AAACCAAATT GGCGTAAAAA CCACCACACT GACCAAAACA GATGCTAAAG GTGATAAAGC 3180 

AATTAACTTT AGTGTTAACT CTGGTGATGA CAAAGCCCTT ATTAACGCCA AAGACATCGC 3 240 

CGACAATCTA AACACCCTAG CTGGTGAAAT TCGCAACACC AAAGGCACAG CAGACACCGC 3 3 00 

CCTACAAACC TTTCAAGTCA AAAAAGTCAA AGAAAATGGT GATGATGATA ATGACGCTGA 3360 

CACCATCACC GTGGGTAAAG ATGCAAAAAC CAATCAAGTC AACACCCTAA AACTCAAAGG 342 0 

TAAAAACGGT CTTGATATTC AAACCAATAA AGATGGTACG GTTAC CTTTG GCATTAACAC 34 8 0 

CCAAAGCGGT CTTAAAGCCG GCAACAACAC CACTCTAAAC AACAATGGCT TGTCTATTAA 3 54 0 

AAACACCGCT GGTAACGAAC AAATCCAAGT CGGTGCTGAT GGCGTGAAGT TTGCCAAGGT 3 60 0 

TAATAATGGT GTTGTAGGTG CTGGCATTGA TGGCACAACT CGCATTACCA GAGATGAAAT 3 660 

TGGCTTTGCT GGGACTAATG GCTCACTTGA TAAAAGCAAA CCCCACCTAA GCAAAGACGG 372 0 

CATTAACGCA GGTGGTAAAA AGATTAC CAA CATTCAATCA GGTGAGATTG CCCAAAACAG 3 7 80 

CAATGATGCT GTGACAGGCG GCAAGATTTA TGATTTAAAA ACCGAACTTG AAAACAAAAT 3 840 

CAGCAGTACT GCCAAAACAG CACAAAACTC ATTACACGAA TTCTCAGTAG CAGATGAACA 3 900 

AGGTAATAAC TTTACGGTTA GTAACCCTTA CTCCAGTTAT GACACCTCAA AGACCTCTGA 3 96 0 
TGTCATCACC TTTGCAGGTG AAAACGGCAT TACCACCAAG GTAAATAAAG GTGTGGTGCG 
TGTGGGCATT GACCAAACCA AAGGCTTAAC CACGCCTAAG CTGACCGTGG GTAATAATAA 

TGGCAAAGGC ATTGTCATTG ACAGCCAAAA TGGTCAAAAT ACCATCACAG GACTAAGCAA 414 0 

CACTCTAGCT AATGTTACCA ATGATAAAGG TAGCGTACGC ACCACAGAAC AGGGCAAGAT 42 00 

AATCAAAGAC GAAGACAAAA CCCGTGCCGC CAGCATTGTT GATGTGCTAA GCGCAGGCTT 42 60 

TAACTTGCAA GGCAATGGTG AAGCGGTTGA CTTTGTCTCC ACTTATGACA CTGTCAACTT 4 32 0 

TGCCGATGGC AATGCCACCA CCGCTAAGGT GACCTATGAT GACACAAGCA AAACCAGTAA 43 8 0 

AGTGGTCTAT GATGTCAATG TGGATGATAC AACCATTGAA GTTAAAGATA AAAAACTTGG 444 0 



020 

:080 



77 



CGTAAAAACC ACCACATTGA CCAGTACTGG CACAGGTGCT AATAAATTTG CCCTAAGCAA 45 00 

TCAAGCTACT GGCGATGCGC TTGTCAAGGC CAGTGATATC GTTGCTCATC TAAAC AC CTT 4560 

ATCTGGCGAC ATCCAAACTG CCAAAGGGGC AAGCCAAGCG AACAGCTCAG CAGGCTATGT 462 0 

GGATGCTGAT GGCAATAAGG TCATCTATGA CAGTACCGAT AACAAGTACT ATCAAGC C AA 4680 

AAATGATGGC ACAGTTGATA AAACCAAAGA AGTTGCCAAA GACAAACTGG TCGCCCAAGC 474 0 

CCAAACCCCA GATGGCACAT TGGCTCAAAT GAATGTCAAA TCAGTCATTA ACAAAGAACA 4 800 

AGTAAATGAT GCCAATAAAA AGCAAGGCAT CAATGAAGAC AACGCCTTTG TTAAAGGACT 4 860 

TGAAAAAGCC GCTTCTGATA ACAAAACCAA AAACGCCGCA GTAACTGTGG GTGATTTAAA 4 92 0 

TGCCGTTGCC CAAACACCGC TGACCTTTGC AGGGGATACA GGCACAACGG CTAAAAAACT 4 980 

GGGCGAGACT TTGACCATCA AAGGTGGGCA AACAGACACC AATAAGCTAA CCGATAATAA 5 040 

CATCGGTGTG GTAGCAGGTA CTGATGGCTT CACTGTCAAA CTTGCCAAAG ACCTAACCAA 5100 

TCTTAACAGC GTTAATGCAG GTGGTACCAA AATTGATGAC AAAGGCGTGT CTTTTGTAGA 5160 

CTCAAGCGGT CAAGCCAAAG CAAACACCCC TGTGCTAAGT GCCAATGGGC TGGACCTGGG 52 20 

TGGCAAGGTC ATCAGCAATG TGGGCAAAGG CACAAAAGAC ACCGACGCTG CCAATGTACA 52 80 

ACAGTTAAAC GAAGTACGCA ACTTGTTGGG TCTTGGTAAT GCTGGTAATG ATAACGCTGA 534 0 

CGGCAATCAG G TAAAC ATTG CCGACATCAA AAAAG AC C C A AATTCAGGTT CATCATCTAA 54 0 0 

CCGCACTGTC ATCAAAGCAG GCACGGTACT TGGCGGTAAA GGTAATAACG ATACCGAAAA 54 60 

ACTTGCCACT GGTGGTGTAC AAGTGGGCGT GGATAAAGAC GGCAACGCTA ACGGCGATTT 552 0 

AAGCAATGTT TGGGTCAAAA CCCAAAAAGA TGGCAGCAAA AAAGCCCTGC TCGCCACTTA 558 0 

TAACGCCGCA GGTCAGACCA ACTATTTGAC CAACAACCCC GCAGAAGCCA TTGACAGAAT 5 64 0 

AAATGAACAA GGTATCCGCT TCTTCCATGT CAACGATGGC AATCAAGAGC CTGTGGTACA 5 70 0 

AGGGCGTAAC GGCATTGACT CAAGTGCCTC AGGCAAGCAC TCAGTGGCGA TAGGTTTCCA 5760 

GGCCAAGGCA GATGGTGAAG CCGCCGTTGC CATAGGCAGA CAAACCCAAG CAGGCAACCA 5 82 0 

ATCCATCGCC AT CGGTGATA ACGCACAAGC CACAGGCGAT CAATCCATCG CCATCGGTAC 5 8 80 

AGGCAATGTG GTAGCAGGTA AGCACTCTGG TGCCATCGGC GACCCAAGCA CTGTTAAGGC 5 940 

TGATAACAGT TACAGTGTGG GTAATAACAA CCAGTTTACC GATGCCACTC AAACCGATGT 6 0 00 

CTTTGGTGTG GGCAATAACA TCACCGTGAC CGAAAGTAAC TCGGTTGCCT TAGGTTCAAA 6060 

CTCTGCCATC AGTGCAGGCA CACACGCAGG CACACAAGCC AAAAAATCTG ACGGCACAGC 612 0 

AGGTACAACC ACCACAGCAG GTGCAACCGG TACGGTTAAA GGCTTTGCTG GACAAACGGC 618 0 
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GGTTGGTGCG GTCTCCGTGG GTGCCTCAGG TGCTGAACGC CGTATCCAAA ATGTGGCAGC 624 0 

AGGTGAGGTC AGTGCCACCA GCACCGATGC GGTCAATGGT AGCCAGTTGT ACAAAGCCAC 630 0 

CCAAGGCATT GCCAACGCAA CCAATGAGCT TGACCATCGT ATCCACCAAA ACGAAAATAA 6360 

AGCCAATGCA GGGATTTCAT CAGCGATGGC GATGGCGTCC ATGCCACAAG CCTACATTCC 642 0 

TGGCAGATCC ATGGTTACCG GGGGTATTGC CACCCACAAC GGTCAAGGTG CGGTGGCAGT 6480 

GGGACTGTCG AAGCTGTCGG ATAATGGTCA ATGGGTATTT AAAATCAATG GTTCAGCCGA 6540 

TACCCAAGGC CATGTAGGGG CGGCAGTTGG TGCAGGTTTT CACTTTTAAG C C AT AAATCG 6600 

CAAGATTTTA CTTAAAAATC AATCTCACCA TAGTTGTATA AAACAGCATC AGCATCAGTC 6660 

ATATTACTGA TGCTGATGTT TTTTATCACT TAAACCATTT TACCGCTCAA GTGATTATCT 6720 

TTCACCATGA CCAAATCGCC ATTGATCATA GGTAAACTTA TTGAGTAAAT TTTATCAATG 67 80 

TAGTTGTTAG ATATGGTTAA AATTGTGCCA TTGACCAAAA AATTACCGAT TTATCCCGAA 6840 

AATTTCTGAT TATGATCACT TTTCATAAAT TTCCCCAATT TGTCTTTATA AATATCCCAA 690 0 

GAAATGGTAT TATTTTATTG CCATCAGCAT ATGCGACAAC TCATCGTATC ATCTTTTTAT 6 960 

CATAAAAATG CAAATAGGCA TATGCATTTT TTGAATTGAA CTTACGCACT GAGAGATCCC 7020 

CTCATAATTT CCCCAAAGCG TAACCATGTG TGAATAAATT TTGAGCTAGT AGGGTTGCAG 7 080 

CCACGAGTAA GTCTTCCCTT GTTATTGTGT AGCCAGAATG CCGCAAAACT TCCATGCCTA 7140 

AGCGAACTGT TGAGAGTACG TTTCGATTTC TGACTGTGTT AGCCTGGAAG TGCTTGTCCC 72 00 
AACCTTGTTT CTGAGCATGA ACGCCCGCAA GCCAACATGT TAGTTGAAGC ATCAGGGCGA 7260 
TTAGCAGCAT GATATCAAAA CGCTCTGAGC TGCTCGTTCG GCTATGGCGT AGGCCTAGTC 73 20 
CGTAGGCAGG ACTTTTCAAG TCTCGGAAGG TTTCTTCAAT CTGCATTCGC TTCGAATAGA 73 8 0 
TATTAACAAG TTGTTTGGGT GTTCGAATTT CAACAGGTAA GTTAGTTGCT AGAATCCATG 7440 
GCTCCTTTGC CGACGCTGAG TAGATTTTAG GTGACGGGTG GTGACAATGA GTCCGTGTCG 750 0 
AG CGCTG ATT TTTTCGGCCT TTAGAGCGAG ATTTATACAA TAGAATTTGG CATGAGATTG 7560 
GATTGCTTTT AGTCAGCCTC TTATAGCCTA AAGTCTTTGA GTGACTAGAT GACATATCAT 762 0 
GTAAGTTGCT GATAGGTTTC CAGTTTTCCG CTCCTAGGTC TGCATATTGT ACTTTTCCTC 7 680 
TTACTCGACT TAACCAGTAC CAACCCAGCT TCTCAACGGA TTTATACCAT GGCACTTTAA 7740 
AGCCAGCATC ACTGACAATG AGCGGTGTGG TGTTACTCGG TAGAATGCTC GCAAGGTCGG 7 800 
CTAGAAATTG GTCATGAGCT TTCTTTGAAC ATTGCTCTGA AAGCGGGAAC GCTTTCTCAT 7860 
AAAGAGTAAC AGAACGACCG TGTAGTG CGA CTGAAGCTCG C AAT AC CAT A AGCCGTTTTT 792 0 
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GCTCACGGAT 


ATCAGACCAG 


TCAACAAGTA 


CAATGGGCAT 


CGTATTGCCC 


GAACAGATAA 


7980 


AGCTAGCATG 


CCAACGGTAT 


ACAGCGAGTC 


GCTCTTTGTG 


GAGGTGACGA 


TTACCTAACA 


8040 


ATCGGTCGAT 


TCGTTTGATG 


TTATGTTTTG 


TTCTCGCTTT 


GGTTGGCAGG 


TTACGGC CAA 


8100 


GTTCGGTAAG 


AGTGAGAGTT 


TTACAGTCAA 


GTAAGGCGTG 


GCAAGCCAAC 


GTTAAGCTGT 


8160 


TGAGTCGTTT 


TAAGTGTAAT 


TCGGGGCAGA 


ATTGGTAAAG 


AGAGTCGTGT 


AAAATATCGA 


8220 


GTTCGCACAT 


TTTGTTGTCT 


GATTATTGAT 


TTTTGGCGAA 


ACCATTTGAT 


CATATGACAA 


8280 


GATGTGTATC 


TACCTTAACT 


TAATGATTTT 


GATAAAAATC 


ATTAGGGGAT 


TCATCAGACT 


8340 


TACGCATCTT 


TCATTATGGG 


AATTAGGTCA 


GTAATTATGA 


CAAAAAATTA 


TGCATTATTA 


8400 


TCCGTCTCAG 


ATAAAACGCA 


AATCGTTGAA 


TTTGCCCAAG 


GTTTGGTAGA 


ATCTGGCTTT 


8460 


GGTATTTTAT 


CCACAGGTGG 


TACTTTTAAA 


CTCTTAAAAG 


AACATGGGAT 


TGACGCCATT 


8520 


GAGGTTTCTG 


CCCATACAGG 


TTTTGCTGAA 


ATGATGGATG 


GTCGTGTTAA 


GAC CCTACAT 


8580 


CCCAAAATTC ATGGTGGTAT TTTGGGCCGT CGTGGCATTG ATGATGCCAT 


TATGAATGAA 


8640 


CATGGCATTG 


ATCGCATTGA 


TATCGTTGTC 


GTGAATTTAT 


ATCCATTTGC 


CAACACGGTC 


8700 


GCCAAAGACG 


GTGTTGTTAT 


GTCTGATGCG 


ATTGAAAATA 


TTGATATTGG 


TGGGCCTGCT 


8760 


ATGGTACGCT 


CAGCCGCCAA 


AAATCATGCC 


CATGTTGGTA 


TTATCACCAG 


CCCAAATGAC 


8820 


TACTCACGCA 


TCCTAGATGA 


ACTAAAAAAC 


CAAGGTCATT 


TAAGCCACAA 


CACTCGTTTT 


8880 


GATTTGGCAG 


TCAAAGCATT 


TGAACACACT 


GCCGCCTATG 


ATGGTATGAT 


TGCCAGCTGG 


8940 


CTAGGTGCAC 


GCTTACCAGT 


GGATAAAGAG 


ACGGCACCCA 


GTGATGATGC 


CACTGCAACC 


9000 


ACTCAATTTT 


CACGCACTTT 


TAATCACCAA 


TTCACCAAAG 


CACAAGAGCT 


TAGATATGGC 


9060 


GAAAACCCAC 


ATCAGTCAGC 


AGCCTTTTAT 


GTAGATGATC 


ATGCAACAGA 


AGCGTCTGTT 


9120 


GCGACTGCAC 


AGCAATTACA 


AGGTAAAGCG 


TTGTCTTATA 


ATAATATTGC 


TGATACCGAT 


9180 


GCGGCACTTG 


AGTGTGTCAA 


ATCTTTTACC 


ACGCCTGCTT 


GTGTGATTGT 


CAAACATGCC 


9240 


AATCCTTGTG 


GTGTTGCAAC 


ATCAGAAAAC 


GGTATTTTAG 


ATGCTTATCA 


CTTAGCATAT 


9300 


GCAACCGATC 


CTGAATCTGC 


CTTTGGTGGC 


ATTATTGCCT 


TTAACCGAGA 


ATTAGACAGT 


9360 


GATACAGCCC 


GTACCATCGT 


TGAGCGTCAA 


TTTGTTGAAG 


TCATCATCGC 


ACCAAGCATC 


9420 


GCTGAAGGTG 


TTCTAGAGCG 


GCCGCGGGCC 


CATCGATTTT 


CCACCCGGGT 


1 GGGGTACCAG 


9480 


GTAAGTGTAC 


CCAATTCGCC 


CTATAGTGAG 


TCGTATTACA 


. ATTCACTGGC 


1 CGTCGTTTTA 


9540 


CA 












9542 



{2) INFORMATION FOR SEQ ID NO: 10: 



80 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2123 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Asn His He Tyr Lys Val He Phe Asn Lys Ala Thr Gly Thr Phe 

15 10 15 

Met Ala Val Ala Glu Tyr Ala Lys Ser His Ser Thr Gly Gly Gly Ser 

20 25 30 

Cys Ala Thr Gly Gin Val Gly Ser Val Arg Thr Leu Ser Phe Ala Arg 

35 40 45 

He Ala Ala Leu Ala Val Leu Val He Gly Ala Thr Leu Asn Gly Ser 

50 55 60 

Ala Tyr Ala Gly He Gly He Ser Glu Ala Asp Gly Gly Lys Gly Gly 
65 70 75 80 

Ala Asn Ala Arg Gly Asp Lys Ser He Ala He Gly Asp He Ala Gin 

85 90 95 

Ala Leu Gly Ser Gin Ser He Ala He Gly Asp Asn Lys lie Val His 

100 105 110 

Asn Ser Asn Asn Asn Ala Asn He Gly Ala Lys Ala Ser Gly Asn Glu 

115 120 125 

Ser He Ala He Gly Gly Asp Val Leu Ala Ser Gly His Ala Ser He 

130 135 140 

Ala He Gly Ser Asp Asp Leu Tyr Leu Lys Lys Glu Thr Val Gin Gin 
145 150 155 160 

He Ser Glu Leu Leu Pro He He Arg Gly Gin Lys Ala Leu Asn Asp 

165 170 175 

He Tyr Gin Leu Ala Asp Thr Asn Leu Gin Lys Tyr Arg Arg Thr His 

180 185 190 

Ala Gin Gly His Ala Ser Thr Ala Val Gly Ala Met Ser Tyr Ala Lys 

195 200 205 

Gly His Phe Ser Asn Ala Phe Gly Thr Arg Ala Thr Ala Glu Gly Thr 

210 215 220 

Tyr Ser Leu Ala Val Gly Leu Thr Ala Thr Ala Lys Ala Ala Ser Ser 
225 230 235 240 

He Ala Val Gly Ser Asn Ala Gin Ala He Gly Phe Ala Ala Thr Ala 

245 250 255 

Val Gly Gly Ser Thr Gin Val Asn Leu Asn Arg Gly He Ala Leu Gly 

260 265 270 

Phe Gly Ser Gin Val Leu Gin Lys Asp Asn Asp Val Asn Ala Ala Asn 

275 280 285 

Val Arg Ala Tyr Ala Pro Asp Asp Asn Gin Pro He Asp Asn Arg Tyr 

290 295 300 

Lys Ala Thr Phe Lys Asn Gly Ala Thr Asp Val Phe Ser He Gly Asn 
305 310 315 320 

Ser Asn Gly Asn Asp Ser He Arg Arg Lys He He Asn Val Gly Ala 

325 330 335 

Gly Ser Ala Asp Thr Asp Ala Val Asn Val Ala Gin Leu Lys Glu Ala 

340 345 350 

Val Arg Leu Ala Asn Arg Gin He Thr Phe Lys Gly Asp Asp Ser Asn 

355 360 365 

Asn Arg Val Glu Lys Gly Leu Gly Lys Thr Leu Thr He Thr Gly Gly 
370 375 380 
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Ala Gin Thr Ser Ala Leu Thr Asp His Asn He Gly Val Val Gin Asn 
385 390 395 400 

Gly Asp Gly Leu Lys Val Gin Leu Ala Glu Thr Leu Thr Ser Leu Lys 

405 410 415 

Met Val Thr Thr Glu Asn Leu Thr Ala Asn Glu Lys Val Thr Val Gly 

420 425 430 

Lys Thr Arg Leu Thr Thr Asp Lys He Gly Phe Thr Asn Asp Met Asn 

435 440 445 

Gly He Asp Glu Ser Lys Pro Tyr Leu Asp Lys Asp Thr Gly He His 

450 455 460 

Ala Gly Gly Gin Lys He Thr Lys Leu Thr Ala Gly Val Val Asp Asp 
465 470 475 480 

Asp Ala Ala Thr Tyr Gly Gin Leu Lys Lys Val Asn Gin Thr Ala Glu 

485 490 495 

Ser Ala Leu Gin Thr Phe Thr Val Lys Lys Val Asp Lys Asn Gly Asn 

500 505 510 

Asp Ala Asn Asp Ser Lys He He Thr Val Gly Lys Asn Asn Lys Pro 

515 520 525 

Asp Gly Thr Gin Val Asn Thr Leu Lys Leu Lys Gly Glu Asn Gly Val 

530 535 540 

Asp Val Thr Thr Glu Thr Asn Gly Thr Val Thr Phe Gly Leu Asn Gin 
545 550 555 560 

Asn Asn Gly Leu Thr Val Gly Asn Ser Thr Leu Asn Asn Asp Gly Leu 

565 570 575 

Ser Val Lys Asn Thr Asn Ser Asn Lys Gin He Gin Val Gly Ala Asp 

580 585 590 

Gly He Thr Phe Thr Asp He Ser Asn Ser Lys Pro Gly Ala Gly He 

595 600 605 

Glu Asn Thr Thr Arg He Thr Arg Asp Gly He Gly Phe Ala Asn Asn 

610 615 620 

Thr Gly Ser Leu Asp Ala Asn Lys Pro Arg Leu Thr Pro Thr Gly He 
625 630 635 640 

Asn Ala Gly Gly Lys Glu Leu Thr Asn Val Gin Ser Ala He Asn Pro 

645 650 655 

Ala Thr Asn Gly Gly Gin Leu Asp Phe Met Asn Arg Leu Ser Thr Ala 

660 665 670 

Asn Thr Glu Lys Ser Gly Ser Ala Ala Thr He Lys Asp Leu Tyr Asn 

675 680 685 

Leu Ser Gin Val Pro Leu Thr Phe Ala Gly Asp Thr Gly Pro Asn Val 

690 695 700 

Thr Lys Lys Leu Gly Glu He Leu Lys Val Lys Gly Gly Lys Thr Thr 
705 710 715 720 

Ala Asp Asp Leu Thr Lys Asn Asn He Gly Val Val Ala Asp Ser Thr 

725 730 735 

Asp Asn Ser Leu Thr Val Lys Leu Ala Lys Thr Leu Ser Asp Leu Asp 

740 745 750 

Ala Val Asn Thr Lys Thr Leu Thr Ala Ser Asp Lys Val Thr Val Asp 

755 760 765 

Ser Gly Asn Asn Thr Ala Lys Leu Gin Asn Gly Asp Leu Thr Phe Ser 

770 775 780 

Lys Gin Asn Thr Gly Ala Thr Pro Ala Thr Asn Ser Lys Thr He Tyr 
785 790 795 800 

Gly Val Asp Gly Leu Lys Phe Thr Asp Asn Asn Gly He Ala Leu Asp 

805 810 815 

Gly Thr Thr Tyr He Thr Lys Asp Lys Val Gly Phe Ala Lys Gin Asp 

820 825 830 

Gly Ser Leu Asp Lys Ser Lys Pro Tyr Leu Asp Lys Asp Lys Leu Lys 
835 840 845 
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Val Gly Glu Val Glu lie Thr Thr Asn Gly lie Asn Ala Gly Gly Lys 

850 855 860 

Ala lie Thr Gly Leu Ser Asn Thr Leu Thr Asp Ala Thr Asn Ala Thr 



865 



875 



Thr Gly His Val Thr Gin Leu Gly He Val Asp Ser Thr Asp Lys Thr 

885 890 895 

Arg Ala Ala Ser He Gly Asp Val Leu Asn Ala Gly Phe Asn Leu Lys 

900 905 910 

Asn Asn Gly Asp Ala Lys Asp Phe Val Ser Thr Tyr Asp Thr Val Asp 

915 920 925 

Phe He Asn Gly Asn Ala Thr Thr Ala Lys Val Thr Tyr Asp Gly Lys 

930 935 940 

Ala Ser Lys Val Ala Tyr Asp Val Asn Val Asp Gly Thr Thr He His 
945 950 955 960 

Leu Thr Gly Ala Asp Gly Asn Lys Asn Gin He Gly Val Lys Thr Thr 

965 970 975 

Thr Leu Thr Lys Thr Asp Ala Lys Gly Asp Lys Ala He Asn Phe Ser 

980 985 990 

Val Asn Ser Gly Asp Asp Lys Ala Leu He Asn Ala Lys Asp He Ala 

995 1000 1005 

Asp Asn Leu Asn Thr Leu Ala Gly Glu He Arg Asn Thr Lys Gly Thr 

1010 1015 1020 

Ala Asp Thr Ala Leu Gin Thr Phe Gin Val Lys Lys Val Lys Glu Asn 
025 1030 1035 1040 

Gly Asp Asp Asp Asn Asp Ala Asp Thr He Thr Val Gly Lys Asp Ala 

1045 1050 1055 

Lys Thr Asn Gin Val Asn Thr Leu Lys Leu Lys Gly Lys Asn Gly Leu 

1060 1065 1070 

Asp He Gin Thr Asn Lys Asp Gly Thr Val Thr Phe Gly He Asn Thr 

1075 1080 1085 

Gin Ser Gly Leu Lys Ala Gly Asn Asn Thr Thr Leu Asn Asn Asn Gly 

1090 1095 1100 

Leu Ser He Lys Asn Thr Ala Gly Asn Glu Gin He Gin Val Gly Ala 
105 1110 1115 1120 

Asp Gly Val Lys Phe Ala Lys Val Asn Asn Gly Val Val Gly Ala Gly 

1125 1130 1135 

He Asp Gly Thr Thr Arg He Thr Arg Asp Glu He Gly Phe Ala Gly 

1140 1145 1150 

Thr Asn Gly Ser Leu Asp Lys Ser Lys Pro His Leu Ser Lys Asp Gly 

1155 1160 1165 

He Asn Ala Gly Gly Lys Lys He Thr Asn He Gin Ser Gly Glu He 

1170 1175 1180 

Ala Gin Asn Ser Asn Asp Ala Val Thr Gly Gly Lys He Tyr Asp Leu 
185 1190 1195 1200 

Lys Thr Glu Leu Glu Asn Lys He Ser Ser Thr Ala Lys Thr Ala Gin 

1205 1210 1215 

Asn Ser Leu His Glu Phe Ser Val Ala Asp Glu Gin Gly Asn Asn Phe 

1220 1225 1230 

Thr Val Ser Asn Pro Tyr Ser Ser Tyr Asp Thr Ser Lys Thr Ser Asp 

1235 1240 1245 

Val He Thr Phe Ala Gly Glu Asn Gly He Thr Thr Lys Val Asn Lys 

1250 1255 1260 

Gly Val Val Arg Val Gly He Asp Gin Thr Lys Gly Leu Thr Thr Pro 
265 1270 1275 1280 

Lys Leu Thr Val Gly Asn Asn Asn Gly Lys Gly He Val He Asp Ser 

1285 1290 1295 

Gin Asn Gly Gin Asn Thr He Thr Gly Leu Ser Asn Thr Leu Ala Asn 
1300 1305 1310 
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Val Thr Asn Asp Lys Gly Ser Val Arg Thr Thr Glu Gin Gly Lys lie 

1315 1320 1325 

He Lys Asp Glu Asp Lys Thr Arg Ala Ala Ser He Val Asp Val Leu 

1330 1335 1340 

Ser Ala Gly Phe Asn Leu Gin Gly Asn Gly Glu Ala Val Asp Phe Val 
345 1350 1355 1360 

Ser Thr Tyr Asp Thr Val Asn Phe Ala Asp Gly Asn Ala Thr Thr Ala 

1365 1370 1375 

Lys Val Thr Tyr Asp Asp Thr Ser Lys Thr Ser Lys Val Val Tyr Asp 

1380 1385 1390 

Val Asn Val Asp Asp Thr Thr He Glu Val Lys Asp Lys Lys Leu Gly 

1395 1400 1405 

Val Lys Thr Thr Thr Leu Thr Ser Thr Gly Thr Gly Ala Asn Lys Phe 

1410 1415 1420 

Ala Leu Ser Asn Gin Ala Thr Gly Asp Ala Leu Val Lys Ala Ser Asp 
425 1430 1435 1440 

He Val Ala His Leu Asn Thr Leu Ser Gly Asp He Gin Thr Ala Lys 

1445 1450 1455 

Gly Ala Ser Gin Ala Asn Ser Ser Ala Gly Tyr Val Asp Ala Asp Gly 

1460 1465 1470 

Asn Lys Val He Tyr Asp Ser Thr Asp Asn Lys Tyr Tyr Gin Ala Lys 

1475 1480 1485 

Asn Asp Gly Thr Val Asp Lys Thr Lys Glu Val Ala Lys Asp Lys Leu 

1490 1495 1500 

Val Ala Gin Ala Gin Thr Pro Asp Gly Thr Leu Ala Gin Met Asn Val 
505 1510 1515 1520 

Lys Ser Val He Asn Lys Glu Gin Val Asn Asp Ala Asn Lys Lys Gin 

1525 1530 1535 

Gly He Asn Glu Asp Asn Ala Phe Val Lys Gly Leu Glu Lys Ala Ala 

1540 1545 1550 

Ser Asp Asn Lys Thr Lys Asn Ala Ala Val Thr Val Gly Asp Leu Asn 

1555 1560 1565 

Ala Val Ala Gin Thr Pro Leu Thr Phe Ala Gly Asp Thr Gly Thr Thr 

1570 1575 1580 

Ala Lys Lys Leu Gly Glu Thr Leu Thr He Lys Gly Gly Gin Thr Asp 
585 1590 1595 1600 

Thr Asn Lys Leu Thr Asp Asn Asn He Gly Val Val Ala Gly Thr Asp 

1605 1610 1615 

Gly Phe Thr Val Lys Leu Ala Lys Asp Leu Thr Asn Leu Asn Ser Val 

1620 1625 1630 

Asn Ala Gly Gly Thr Lys He Asp Asp Lys Gly Val Ser Phe Val Asp 

1635 1640 1645 

Ser Ser Gly Gin Ala Lys Ala Asn Thr Pro Val Leu Ser Ala Asn Gly 

1650 1655 1660 

Leu Asp Leu Gly Gly Lys Val He "Ser Asn Val Gly Lys Gly Thr Lys 
665 1670 1675 1680 

Asp Thr Asp Ala Ala Asn Val Gin Gin Leu Asn Glu Val Arg Asn Leu 

1685 1690 1695 

Leu Gly Leu Gly Asn Ala Gly Asn Asp Asn Ala Asp Gly Asn Gin Val 

1700 1705 1710 

Asn He Ala Asp He Lys Lys Asp Pro Asn Ser Gly Ser Ser Ser Asn 

1715 1720 1725 

Arg Thr Val He Lys Ala Gly Thr Val Leu Gly Gly Lys Gly Asn Asn 

1730 1735 1740 

Asp Thr Glu Lys Leu Ala Thr Gly Gly Val Gin Val Gly Val Asp Lys 
745 1750 1755 1760 

Asp Gly Asn Ala Asn Gly Asp Leu Ser Asn Val Trp Val Lys Thr Gin 
1765 1770 1775 
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Lys Asp Gly Ser Lys Lys Ala Leu Leu Ala Thr Tyr Asn Ala Ala Gly 

1780 1785 1790 

Gin Thr Asn Tyr Leu Thr Asn Asn Pro Ala Glu Ala He Asp Arg He 

1795 1800 1805 

Asn Glu Gin Gly He Arg Phe Phe His Val Asn Asp Gly Asn Gin Glu 

1810 1815 1820 

Pro Val Val Gin Gly Arg Asn Gly He Asp Ser Ser Ala Ser Gly Lys 
825 1830 1835 1840 

His Ser Val Ala He Gly Phe Gin Ala Lys Ala Asp Gly Glu Ala Ala 

1845 1850 1855 

Val Ala He Gly Arg Gin Thr Gin Ala Gly Asn Gin Ser He Ala He 

I860 1865 1870 

Gly Asp Asn Ala Gin Ala Thr Gly Asp Gin Ser He Ala He Gly Thr 

1875 1880 1885 

Gly Asn Val Val Ala Gly Lys His Ser Gly Ala He Gly Asp Pro Ser 

1890 1895 1900 

Thr Val Lys Ala Asp Asn Ser Tyr Ser Val Gly Asn Asn Asn Gin Phe 
905 1910 1915 1920 

Thr Asp Ala Thr Gin Thr Asp Val Phe Gly Val Gly Asn Asn He Thr 

1925 1930 1935 

Val Thr Glu Ser Asn Ser Val Ala Leu Gly Ser Asn Ser Ala He Ser 

1940 1945 1950 

Ala Gly Thr His Ala Gly Thr Gin Ala Lys Lys Ser Asp Gly Thr Ala 

1955 1960 1965 

Gly Thr Thr Thr Thr Ala Gly Ala Thr Gly Thr Val Lys Gly Phe Ala 

1970 1975 1980 

Gly Gin Thr Ala Val Gly Ala Val Ser Val Gly Ala Ser Gly Ala Glu 
985 1990 1995 2000 

Arg Arg He Gin Asn Val Ala Ala Gly Glu Val Ser Ala Thr Ser Thr 

2005 2010 2015 

Asp Ala Val Asn Gly Ser Gin Leu Tyr Lys Ala Thr Gin Gly He Ala 

2020 2025 2030 

Asn Ala Thr Asn Glu Leu Asp His Arg lie His Gin Asn Glu Asn Lys 

2035 2040 2045 

Ala Asn Ala Gly He Ser Ser Ala Met Ala Met Ala Ser Met Pro Gin 

2050 2055 2060 

Ala Tyr He Pro Gly Arg Ser Met Val Thr Gly Gly He Ala Thr His 
065 2070 2075 2080 

Asn Gly Gin Gly Ala Val Ala Val Gly Leu Ser Lys Leu Ser Asp Asn 

2085 2090 2095 

Gly Gin Trp Val Phe Lys lie Asn Gly Ser Ala Asp Thr Gin Gly His 

2100 2105 2110 

Val Gly Ala Ala Val Gly Ala Gly Phe His Phe 
2115 2120 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNE S S : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11 : 
Gly Thr Val Leu Gly Gly Lys 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Gly lie Gly lie Ser Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala 

15 10 15 

Arg Gly Asp Lys Ser lie Ala lie Gly Asp lie Ala Gin Ala Leu Gly 

20 25 30 

Ser Gin Ser lie Ala lie Gly Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg Gly Asp Lys Ser 

15 10 15 

He Ala He Gly Asp He Ala Gin 
20 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Gly Ala Arg Gly Cys Asn Gly Ala Tyr Gly Gly Asn Gly Gly Asn Ala 

15 10 15 

Ala Arg 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Tyr Thr Gly Asn Gly Cys Asp Ala Thr Arg Thr Cys Asn Cys Cys Asp 

15 10 15 

Ala Thr 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GAAGCGGACG GGGGGAAA 

(2) INFORMATION FOR SEQ ID NO : 17 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.17: 
TTGCGCAATG TCACCAAT 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



87 



GAAGCGGACG GGGGGAAAGG CGGAGCCAAT GCGCGCGGTG ATAAATC CAT TGCTATTGGT 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Glu Ala Asp Gly Gly Lys Gly Gly Ala Asn Ala Arg Gly Asp Lys Ser 
15 10 15 

lie Ala lie Gly Asp lie Ala Gin 
20 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TCATCATTGG AAAACGTTCT TCGGGGCGAA 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



CGGTCAGCTT AGGCGTGGTT 



